Parallel time integration using Batched BLAS (Basic Linear Algebra Subprograms) routines
نویسندگان
چکیده
We present an approach for integrating the time evolution of quantum systems. leverage computation power graphics processing units (GPUs) to perform integration all steps in parallel. The performance boost is especially prominent small medium-sized devised algorithm can largely be implemented using recently-specified batched versions BLAS routines, and therefore easily ported a variety platforms. Our PARAllelized Matrix Exponentiation Numerical Time (PARAMENT) implementation runs on CUDA-enabled units. Program Title: PARAMENT CPC Library link program files: https://doi.org/10.17632/zy5v4xs89d.1 Developer's repository link: https://github.com/parament-integrator/parament Licensing provisions: Apache 2.0 Programming language: C / CUDA Python Nature problem: Time-integration Schrödinger equation with time-dependent Hamiltonian systems Hilbert space but many time-steps. Solution method: A 4th order Magnus integrator, highly parallelized GPU, subset functions improved portability.
منابع مشابه
PB-BLAS: a set of parallel block basic linear algebra subprograms
We propose a new software package which would be very useful for implementing dense linear algebra algorithms on block-partitioned matrices. The routines are referred to as block basic linear algebra subprograms (BLAS), and their use is restricted to computations in which one or more of the matrices involved consists of a single row or column of blocks, and in which no more than one of the matr...
متن کاملLightning Talk: Creating a Standardised Set of Batched BLAS Routines
One trend in modern high performance computing is to decompose a large linear algebra problem into thousands of small problems that can be solved independently. For this purpose we are developing a new BLAS standard (Batched BLAS), allowing users to perform thousands of small BLAS operations in parallel and making efficient use of their hardware. We discuss and introduce some details about how ...
متن کاملTowards Reversible Basic Linear Algebra Subprograms: A Performance Study
Problems such as fault tolerance and scalable synchronization can be efficiently solved using reversibility of applications. Making applications reversible by relying on computation rather than on memory is ideal for large scale parallel computing, especially for the next generation of supercomputers in which memory is expensive in terms of latency, energy, and price. In this direction, a case ...
متن کاملAn Extended Set of Fortran Basic Linear Algebra Subprograms
This paper describes an extension to the set of Basic Linear Algebra Subprograms. The extensions are targeted at matrix-vector operations which should provide for efficient and portable implementations of algorithms for high performance computers. An Extended Set of Fortran Basic Linear Algebra Subprograms Jack J. Dongarra † Mathematics and Computer Science Division Argonne National Laboratory ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Computer Physics Communications
سال: 2022
ISSN: ['1879-2944', '0010-4655']
DOI: https://doi.org/10.1016/j.cpc.2021.108181